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We report the draft genome sequence of Bacillus thuringiensis strain LM1212, which differentiates into crystal producers or 
spore formers during the stationary phase. Availability of this genome sequence will facilitate the study of spore formation, crys- 
tal formation, cell differentiation, and evolution of B. thuringiensis. 
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J ere we report the draft genome sequence of Bacillus thurin- 
i giensis strain LM 1212, isolated from the cadaver of an Oryctes 
gigas larva in Madagascar ( 1 ) . This strain produces a typical crystal 
inclusion but presents the intriguing ability to differentiate into 
crystal producers or spore formers during the stationary phase. 
This characteristic is significantly different from that of all known 
B. thuringiensis strains which produce crystal proteins in the spo- 
rulating cells (spore formers). 

Total DNA was isolated from B. thuringiensis strain LM1212. 
Genome sequencing was performed with the 454 GS-FLX Tita- 
nium (Roche Applied Science) and Illumina Hi-Seq 2000 plat- 
forms. A total of 170,744 high-quality Roche 454 reads with an 
average read length of 400 bp were produced, providing about 
1 1 -fold coverage of the genome, while the Illumina reads provided 
1,255-fold coverage with 38 million reads of 100 bp (insert size, 
3,000 bp). After preprocessing, Roche 454 reads were assembled 
into contigs with Newbler version 2.6, and then scaffolded with 
Illumina mate-pair reads using SSPACE (2). The gaps within scaf- 
folds were closed with GapFiller (3). The open reading frames 
were identified by using Glimmer version 3.02 (4) . The tRNAs and 
rRNAs were predicted using tRNAscan-SE (5) andRNAmmer (6), 
respectively. The functions of encoding genes were annotated by 
using the NCBI nr, Swiss-Prot (7), Clusters of Orthologous 
Groups (COG) (8), KEGG (9), and InterProScan (10) databases. 

The draft genome sequence consists of 51 scaffolds and 136 
contigs with a GC content of 35.33% and a total length of 
6,062,180 bases. A total of 5,286 coding sequences (CDS) and 
73 tRNA and 5 rRNA operons were predicted. Approximately 
64.16% of all coding sequences (a total of 3,392) were assigned 
to COG, and 1,814 CDS were annotated into the 166 pathways 
by using KAAS (11). This strain harbors 7 genes encoding 
products similar to Cry proteins. Based on the amino acid se- 
quences, five delta-endotoxins in LM1212 (Cry32Val, 
Cry74Aal, Cry45Bal, Cry41Cal, and Cry32Mcl) were classified 
as novel Cry toxins by the Bt Toxin Nomenclature Committee 



(http://www.lifesci.sussex.ac.uk/home/Neil_Crickmore/Bt/) and 
four of them (Cry74Aal, Cry45Bal, Cry41Cal, and Cry32Mcl) 
showed a high degree of similarity to the cancer cell-killing protein 
parasporins (12). 

Nucleotide sequence accession numbers. This whole-genome 
shotgun project has been deposited at DDBJ/EMBL/GenBank un- 
der the accession number AYPV00000000. The version described 
in this paper is version AYPV01000000. 
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